Eurostatv2 pr #1094

shapateriya · 2024-10-17T15:35:21Z

No description provided.

beets · 2024-10-17T22:00:56Z

Thanks for the PR and getting this import done quickly!

A few initial comments:

Please run the formatter (./run_tests.sh -f)
Please add details to the PR description. Specifically, I'd like to see:
- Source file URL
- Any commands used to generate these files. The reviewer (or anyone after) should be able to duplicate the output files based on the instructions here.
- Any additional manual updates done as part of the import.
- Verifications done on the output (please include artifacts where possible)

In the future, all scripts should be accompanied by tests. We can skip it this time due to the time crunch, but I would like us to come back to revisit this in the next month.

fyi @manishvats2

beets

thanks! these changes look good, especially the updates on measurement methods and stat vars. i do want us to think through the update on Count_Person_Employed --> dc/nm9hcklgg5zb3 (/cc @ajaits @hareesh-ms)

please use git lfs for the input and output tsv / csv's. we should have examples of these in the repo

beets · 2024-10-17T23:22:42Z

scripts/eurostat/regional_statistics_by_nuts/birth_death_migration/import_data.py

@@ -126,8 +129,15 @@ def clean_data(preprocessed_df, output_path):

 # replace colon with NaN.
 clean_df = clean_df.replace(':', '')
-
- clean_df['geo'] = 'dcid:nuts/' + clean_df['geo']
+ # for ind, geo in enumerate(clean_df['geo']):


please remove the commented out code

beets · 2024-10-17T23:25:27Z

...s/eurostat/regional_statistics_by_nuts/employed_per_sector/emp_persec_preprocess_gen_tmcf.py

@@ -37,24 +40,37 @@
 'Count_Person_Employed_NACE/O-Q',
 'Count_Person_Employed_NACE/O-U',
 'Count_Person_Employed_NACE/R-U',
- 'Count_Person_Employed',
+ 'dc/nm9hcklgg5zb3',


please add a comment that this is "Population: Employed"

beets · 2024-10-17T23:27:25Z

scripts/eurostat/regional_statistics_by_nuts/gdp/import_data.py

@@ -72,7 +82,12 @@ def download_data(self):
 """Downloads raw data from Eurostat website and stores it in instance
 data frame.
 """
- self.raw_df = pd.read_table(self.DATA_LINK)
+ # self.raw_df = pd.read_table(self.DATA_LINK)


please remove

beets · 2024-10-17T23:27:29Z

scripts/eurostat/regional_statistics_by_nuts/gdp/import_data.py

+ self.raw_df = pd.read_table("nama_10r_3gdp.tsv.gz")
+ self.raw_df = self.raw_df.rename(columns=({'freq,unit,geo\TIME_PERIOD': 'unit,geo\\time'}))
+ self.raw_df['unit,geo\\time'] = self.raw_df['unit,geo\\time'].str.slice(2)
+ # return raw_df


please remove

beets · 2024-10-17T23:27:38Z

scripts/eurostat/regional_statistics_by_nuts/gdp/import_data.py

@@ -174,13 +191,16 @@ def generate_tmcf(self):
 assert col in ['geo', 'time']
 continue
 col_num += 1
+ # Amount_EconomicActivity_GrossDomesticProduction_Nominal_AsAFractionOf_Count_Person


please remove

beets · 2024-10-17T23:28:14Z

...stat/regional_statistics_by_nuts/population_density/PopulationDensity_preprocess_gen_tmcf.py

@@ -95,5 +110,9 @@ def get_template_mcf():


 if __name__ == "__main__":
+ # _DATA_URL = "https://ec.europa.eu/eurostat/estat-navtree-portlet-prod/BulkDownloadListing?file=data/demo_r_d3dens.tsv.gz"


please remove

shapateriya added 2 commits October 17, 2024 15:08

eurostatv2_pr

bdbe359

eurostatv2_pr

c33e6f4

blunderbuss-gcf bot assigned beets Oct 17, 2024

beets reviewed Oct 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eurostatv2 pr #1094

Eurostatv2 pr #1094

shapateriya commented Oct 17, 2024

beets commented Oct 17, 2024 •

edited

Loading

beets left a comment

beets Oct 17, 2024

beets Oct 17, 2024

beets Oct 17, 2024

beets Oct 17, 2024

beets Oct 17, 2024

beets Oct 17, 2024

		@@ -95,5 +110,9 @@ def get_template_mcf():


		if __name__ == "__main__":
		# _DATA_URL = "https://ec.europa.eu/eurostat/estat-navtree-portlet-prod/BulkDownloadListing?file=data/demo_r_d3dens.tsv.gz"

Eurostatv2 pr #1094

Are you sure you want to change the base?

Eurostatv2 pr #1094

Conversation

shapateriya commented Oct 17, 2024

beets commented Oct 17, 2024 • edited Loading

beets left a comment

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets Oct 17, 2024

Choose a reason for hiding this comment

beets commented Oct 17, 2024 •

edited

Loading